ParsBERT: Transformer-based Model for Persian Language Understanding

نویسندگان

چکیده

The surge of pre-trained language models has begun a new era in the field Natural Language Processing (NLP) by allowing us to build powerful models. Among these models, Transformer-based such as BERT have become increasingly popular due their state-of-the-art performance. However, are usually focused on English, leaving other languages multilingual with limited resources. This paper proposes monolingual for Persian (ParsBERT), which shows its performance compared architectures and Also, since amount data available NLP tasks is very restricted, massive dataset different well pre-training model composed. ParsBERT obtains higher scores all datasets, including existing ones gathered ones, improves outperforming both prior works Sentiment Analysis, Text Classification, Named Entity Recognition tasks.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural Language Understanding for Grading Essay Questions in Persian Language

many intelligent systems are intended to communicate with users through natural language. Understanding the natural language by the computer is one of the most essential operations in natural language processing. One of the applications of natural languages is in the exams having essay questions. The objective of this paper is to propose a method for designing an examiner machine and creating a...

متن کامل

Cooperative model-based language understanding

In this paper, we propose a novel Cooperative Model for natural language understanding in a dialogue system. We build this based on both Finite State Model (FSM) and Statistical Learning Model (SLM). FSM provides two strategies for language understanding and have a high accuracy but little robustness and flexibility. Statistical approach is much more robust but less accurate. Cooperative Model ...

متن کامل

A POS-Based Word Prediction System for the Persian Language

Word prediction is the problem of guessing the words which are likely to follow in a given text segment by displaying a list of the most probable words that could appear in that position. In this research, we designed and implemented three word predictors for Persian. Our baseline is a statisticalbased system which uses language models. The first system uses word statistics; in the second one w...

متن کامل

Exploring Model Building for Natural Language Understanding

In this paper the use of model building for natural language understanding in practical systems is investigated. After outlining several interpretation tasks where model construction would be useful, we perform experiments using two state-of-the-art model builders on the interpretation of imperatives in discourse. The results are acceptable for small discourses and small domains, but don’t scal...

متن کامل

Model Building for Natural Language Understanding

This paper introduces a discipline in the field of automated reasoning that has received too little attention so far within computational semantics: model generation or model building for first-order logic. Model builders offer a positive handle on the satisfiability problem (theorem provers are, in general, unable to detect the satisfiability of a problem) and are therefore often used in tande...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Processing Letters

سال: 2021

ISSN: ['1573-773X', '1370-4621']

DOI: https://doi.org/10.1007/s11063-021-10528-4